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Abstract 

The d-dimensional Hamming torus is the graph whose vertices are all of the integer points 
inside an a\n x 0,211 x • ■ ■ x adn box in R d (for constants 01, . . . , a<j > 0), and whose edges connect 
all vertices within Hamming distance one. We study the size of the largest connected component 
of the subgraph generated by independently removing each vertex of the Hamming torus with 
probability 1 — p. We show that if p = «, then there exists A c > 0, which is the positive root of 
a degree d polynomial whose coefficients depend on 01, . . . , aj, such that for A < A c the largest 
component has O(logn) vertices (a.a.s. as n — > 00), and for A > A c the largest component has 
(1 — q)\ (Yli a i) n d ~ 1 + o(n ) vertices and the second largest component has O(logn) vertices 
(a.a.s.). An implicit formula for q < 1 is also given. Surprisingly, the value of A c that we find 
is distinct from the critical value for the emergence of a giant component in the random edge 
subgraph of the Hamming torus. Additionally, we show that if p = , then when c < jfi 

the site subgraph of the Hamming torus is not connected, and when c > the subgraph is 
connected (a.a.s.). We also show that the subgraph is connected precisely when it contains no 
isolated vertices. 

1 Introduction 

Erdos and Renyi studied random subgraphs of the complete graph in [5] (for an account of their 
results, see [5]). In the Erdos- Renyi model, a random subgraph of the complete graph on n nodes is 
obtained by independently deciding whether to remove each edge with probability (1 — p) or keep 
it with probability p. Erdos and Renyi studied the size of the largest connected component in the 
random subgraph under the scaling p = A/n, where A is a constant parameter. They found that 
the size of the largest connected component is asymptotically almost surely O(logn) for A < 1, and 
cn + o(n) for c = c(A) > when A > 1. A sequence of events E n is said to occur asymptotically 
almost surely (a.a.s.) if P (E n ) — > 1 as n — > 00. Thus, at A = 1, the Erdos-Renyi model undergoes a 
transition from having small components to having a giant connected component. 

Random subgraphs generated by randomly removing edges independently with probability (1— p) 
have since been extensively studied on other graphs G = (V,E), and we will refer to random 
subgraphs generated in this manner as random edge subgraphs. Alternatively, random subgraphs 

* Dissertation research at UC Davis supported in part by NSF VIGRE Grant No. DMS-0636297, and NSF Grant 
Nos. DMS-0805970 and DMS-0505734. 



1 



of G can be obtained by independently removing vertices (and any edges incident to those vertices) 
with probability (1 — p); we will refer to these as random site subgraphs. 

The n-dimensional hypercube is an example of a graph for which both random edge and site 
subgraphs have been studied. Ajtai et al. [T] studied random edge subgraphs of the n-dimensional 
hypercube: V = {0, 1}™, E = {(x, y) e V x V : d(x,y) — 1}, where d(x,y) is the Hamming distance 
between x and y (the number of coordinates in which they differ). For the scaling p = X/n, they 
proved that all components of the random edge subgraph have 0(n) vertices when A < 1 (a.a.s.), 
and for A > 1 there is a component with c2" + o(2 n ) vertices (a.a.s.) for c = c(A) > 0. Bollobas et. 
al. [5] found similar behavior for the random site subgraph of the n-dimensional hypercube. Under 
the same scaling, they proved that for A < 1 all components have at most 0(n) vertices (a.a.s.), and 
for A > 1 there is a component with cn~ 1 2 n + o(n~ 1 2 n ) vertices for c = c(A) > (a.a.s.). Thus, both 
random subgraph models have the same threshold under the scaling p = X/n for the n-dimensional 
hypercube. In fact, the value of c(A) is the same for both models, so even the proportion of vertices 
in the giant component above the threshold is the same. 

The intuition behind each of these results is that random edge subgraphs of the hypercube and 
the complete graph and random site subgraphs of the hypercube look locally tree-like. So, the 
connected component associated with a fixed vertex can be compared to a branching process with 
a Binomial(Z), p) offspring distribution, where D is the degree of each vertex. A giant connected 
component exists precisely when the branching process has positive survival probability, which is 
when the expected number of offspring per individual (Dp = X) exceeds 1. Borgs et al. [3 [7] 
showed that, for a wide variety of finite transitive graphs, the threshold for the existence of a giant 
component in the random edge subgraph should be at Dp = 1, though they focus on behavior in 
the critical window, and do not give a lower bound for the size of the supercritical giant component. 
The case of random site subgraphs is less clear — just consider the complete graph. All random 
site subgraphs of the complete graph are connected, thus the threshold for the existence of a giant 
component is 0. This is a trivial case, but suggests that the random edge and site subgraphs will 
behave differently when the edge density of G is closer to that of the complete graph than to the 
hypercube. 

Random site subgraphs are particularly relevant to problems that arise from the theory of genetic 
fitness landscapes. Fitness landscapes are a tool used in theoretical evolutionary biology to model 
speciation under various conditions [10]. A fitness landscape consists of a genotype space and a 
function that maps each combination of genes to a fitness level in the interval [0,1], which can 
be interpreted as the probability of an individual with that genotype surviving to reproduce. For 
example, the genotype space for a haploid genome with n diallelic loci (two gene alternatives at each 
position along the genome) can be represented by the n-dimensional hypercube, where each vertex 
represents a genotype and edges represent single-gene mutations. Each genotype is then assigned a 
fitness level of 1 with probability p or with probability (1 — p), independent of all other genotypes 
— this is equivalent to taking a random site subgraph. For this model, provided n is sufficiently 
large, only a small proportion of genotypes need to be viable (p > 1/n) for a significant number of 
viable genotypes to be accessible via single-gene mutations. 

The random graph model that wc are interested in comes from a genotype space for a haploid 
genome with d loci, and a^n possible alleles at the i th locus. Here d is fixed, and the number of 
alleles at each locus is assumed to be large. The resulting graph is a Hamming torus in d dimensions: 

V = {(xi, . . . ,x d ) € 1 d : 1 < x 4 < a t n, i = l,...,d} 
E={{x,y)eVxV:d(x,y) = l}, 

where d(x, y) is the Hamming distance between x and y. For d — 1 this is the complete graph on a,\n 
vertices, and for d = 2 this is the rook graph on an a\n by a^n chessboard. As before, fitness levels 
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of 1 are assigned independently with probability p, and fitness of is otherwise inferred. We say 
that sites with a fitness of 1 are occupied, while those with a fitness of are removed or unoccupied. 

This model is amenable to comparison with a multitype branching process. Consider a multitype 
branching process with d types in which an individual of type i has a Binomial(ajn, ^) number of 
offspring of type j for j ^ i and zero offspring of type i. This branching process has nonzero survival 
probability iff the largest eigenvalue of the matrix of expected progeny, M\, is strictly larger than 
one 0. 
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where the entry in position is the expected number of type j offspring to which a type i 

individual will give birth. This branching process is a good local approximation for clusters in the 
random site subgraph of the Hamming torus when p = — , which motivates the following theorems. 

Theorem 1.1. Fix d > 2 and ai,...,a<j > 0, and let X c be the unique positive solution to 
det {M\ a — I) = 0. If X < X c and p = ^ then the largest connected component of the random 
site subgraph of the Hamming torus is a.a.s. O(logn). 



Theorem 1.2 
I) 



Fix d > 2 and a\ , . 
= 0. If X > X c and p 



,a d > 0, and let X c be the unique positive solution to 
^ then the size of the largest connected component of the 
random site subgraph of the Hamming torus is a.a.s. (1 — q)X (Y[ i dj) n' 1 " 1 + o(n <i ~ 1 ). Furthermore, 



det (M Xc 

. si 

the second largest component has a.a.s. O(logn) vertices. 



For Theorem 1.2 (1 — q) > is defined precisely in Section 3.4 and in both theorems J is the 
d x d identity matrix. Note that the total number of vertices in the random site subgraph of the 
Hamming torus is asymptotically the expected number, p\V\ — X (J\ i Oi) n 1 , so (1 — q) is the 
proportion of vertices remaining that are in the giant component. The critical value, A c , can be 
defined equivalently as the root of a degree d polynomial due to the following lemma. 

Lemma 1.3. 



det(M A - I) = (-1 ) d 
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Random edge subgraphs of the Hamming torus were considered by Borgs et al. [SI [7] , who indi- 
cated that the threshold for the emergence of a giant component in the random edge subgraph should 
be l/(ai + • • • + a,d), though they do not provide a lower bound for the size of the largest component 
above the threshold. This lower bound was later proven by van der Hofstad and Luczak |12j in the 
case d = 2 and a\ = a 2 = 1, demonstrating that the threshold occurs at 1/2. When d — 2, the 
threshold for the random site subgraph is A c = 1/ \Ja\CL2, so the two processes clearly differ. 

Multitype branching processes have also been employed in the analysis of inhomogeneous random 
edge subgraphs of the complete graph by Bollobas et al. [4 1. In the inhomogeneous model, the 
probability of retaining the edge between vertices i and j is py, and the p-y's may not be equal, 
however the inclusion or exclusion of each edge still occurs independently of all other edges. Since 
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the number of neighbors that a vertex has depends on the probabilities of the edges incident to that 
vertex, the inhomogeneous model looks locally like a multitype branching process — each vertex 
clearly has a different offspring distribution, and is thus of a different "type". Bollobas et al. [4] 
proved that, under certain conditions on the values of the p^'s (which exclude the case of the 
Hamming torus), a giant component exists precisely when the corresponding multitype branching 
process survives. In the case of random site subgraphs of the Hamming torus, it is somewhat 
surprising that a connection to multitype branching processes arises, since each vertex sees the same 
distribution of neighbors. In this model, the similarity to a multitype branching process is due to 
the dependencies between edges, and not the inhomogeneity of the edge probabilities as in [4]. 

In Section [2] we prove Theorem 1 1. 1 1 by coupling a process of revealing the vertices in a connected 
component of the random site subgraph with a multitype branching process. In Section [3] we prove 



Theorem 1.2 in four steps. The first step, which is in Section 3.2 is to show that the process of 
revealing the vertices in a connected component of the random site subgraph will either terminate 
before discovering O(logn) occupied vertices, or will reach size m = 0(logn) with high probability. 
The second step, in Section |3.1| is to show that if the process of revealing vertices reaches size 
771, then it can be coupled with a lower bounding branching process, which will reach size n d ~ 4 / 3 
with high probability. The third step, in Section [3. 3| is to show that any two component-discovering 
processes (started from two different vertices in V) that reach size n d ~ 4 ^ 3 will join together with high 
probability. The final step, in Section [3.4| is to show that the proportion of vertices in components 
of size O (log n) converges to q in probability. The reason we reverse the order in which the first two 
steps are presented is because m needs to be defined before proving the first step, but this definition 
is motivated by the second step. In Section [4] we will prove the following three theorems regarding 
the connectivity of the random site subgraph. 

Theorem 1.4. Let c < |=j^t- If P = p(n) < c '° s " and p = w(n~ d ) then the random site subgraph 
of the Hamming torus contains isolated vertices, and is thus not connected (a.a.s.). 

Theorem 1.5. Let c > yT 1 . If p = p(n) > c - then the random site subgraph of the Hamming 
torus is connected (a.a.s.). 

Theorem 1.6. Fix a x > a 2 > • • • > a d , and let c > 2 ^-.d d ^ +a . If p = cl ° s " then every 
vertex in the random site subgraph of the Hamming torus is either isolated or belongs to the giant 
component (a.a.s.). 

These three theorems together imply that, with probability approaching one, the random site 
subgraph of the Hamming torus is connected if and only if it contains no isolated vertices (except 
in the trivial case p x n , where the subgraph may consist of just a single occupied vertex with 
positive probability). 



2 Subcritical behavior 



Theorem |1.1| comes from a direct comparison with the binomial multitype branching process de- 
scribed above, and thus follows from Proposition 2.2 below, which bounds the total size of a multi- 
type branching process. Consider a multitype branching process with d types, and let Zq, Zi, Z2, ■ ■ ■ 
be the generations of this process, where Z t is a vector whose i th component 



Z\, is the number of 



type i individuals in the t generation. Z a is assumed to be deterministic, and all birth events are 
independent of one another. Let M = (my) denote the matrix of expectations: 



E 
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We say that a matrix, A, or a vector, x, is positive if Ay > for all i and j or a;, > for all z. 



Lemma 2.1 is a simplified restatement of the Perron- Frobenius Theorem as it appears in [2; 



Lemma 2.1. If M N is positive for some natural number N then M has a positive simple eigenvalue, 
p, that is greater in absolute value than any other eigenvalue, and p corresponds to a positive right 
eigenvector fi : Mfi = pfx. 



We wish to state Proposition |2.2| in some generality, but we will only be applying it to the 
multitype branching process with expectation matrix M\. When d > 3, it is easy to check that 
M. \ is positive, and when d = 2 it can be verified directly that M\ has a unique positive eigenvalue 
corresponding to a positive right eigenvector. 

Proposition 2.2. // Mfi = pfj, where < p < 1 and fi is positive, and there exists 9q > such 
thatE [e e °< Zl '^|Zo = e-] < oo for i = 1, d, thenW(J2tLo \\ z t\\i > x ) < Ce~ ax where a,C > 
can be chosen as follows: 



mm max 

9e[0,9 ] i 



E 



Z = ei 



and, letting 8' be the value of 9 for which this minimum is attained, C := e' 



'(Zo,iA 



Proof of Proposition 2.2 Suppose X > is an arbitrary random variable such that Ee SlX < oo for 
some #o > 0. By expanding, for 9 < 9 n : 



d 
d9 



'Ee ex ) 



Thus, for each i = 1, . . . , d, 
d 



d9 



E 



10=0 



^- (l + 9 EX + 1-9 2 EX 2 
d9 V 2 



= EX. 



9=0 



Z = ei 



9=0 



= E[{Z 1 ,(i)\Z = = pm 



(1) 



Now consider a random walk version of the multitype branching process, (St), constructed as fol- 
lows. Sq = Zq is a non-random initial vector whose i component indicates the number of type 
i individuals who are active. At each step, an active individual is chosen uniformly at random, it 
gives birth to a random number of individuals depending on its type and according to the law for 
that type in the branching process (these new individuals are considered active), then it is made 
inactive (thus no longer included in S t ). 



St+i — St 



i=l 



[ X t+1 - e i 



(2) 



Where j t is the random variable that takes the value i if an active individual of type i is selected 
at time t, so P (jt+i = % \ St — v) — Vi/ Wv^, and for each i = 1, ...,d the random vectors X\ are 
independent and equal in distribution to Z\ conditional on = ei. Notice that jt+\ is dependent 
on St, but X\ +1 is not. The process continues until the stopping time T := inf {t \ St = 0}, at 
which time the process dies out. Notice that T = J2t=o ll^tllr 
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Now consider an increment of this process: 



cj> v {6) :=E 



exp (o l{,' t+1 =i} (Xl+i - e<) , 



< max Ee e (( xl i- e >)^> 
=: ^(0). 



(3) 



Note that the last expression does not depend on the previous state vector, v. By equation Q, for 
each i = 1, . . . , d we have: 
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Since p < 1 and /ii > for all i, we have that there exist 0',a > such that ip(6') = e a < 1. The 
following computation shows that e e ^ St '^ / -0 (6 1 )* is a supermartingale, and uses the estimate in O). 
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St] = E 



<M#) 
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So, by the Optional Stopping Theorem for positive supermartingales, 

' e 6'(S T ,n) 



AT 



= Ee QT 



then by Markov's inequality: 



(T > x) = P (e QT > e c 
< Ce- Q2; . 



□ 



Proof of Theorem \l.l\ Consider a fixed vertex, v, in the Hamming Torus, and let C v be the cluster 
containing that vertex (C v = if the vertex v is unoccupied). We can reveal the vertices in C v 
using the following cluster-discovering algorithm: 

1. Initialize the sets R = 0, A = {v}, U = V \ A . 

2. Choose a vertex, v t , from A t in some measurable way (e.g., lexicographically or uniformly at 
random) . 

3. A t+ i = A t \ {v t } U {w e Af{v t ) n U t | w is occupied} 

4. U t+1 = U t \Af(v t ) 

5. R t+1 =R f U {v t } 

6. If A t+ i is empty, then return C v = Rt+i, otherwise increment t and go to step 2. 
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In the above algorithm, Af(v) = {w G V \ d(v, w) = 1} is the neighborhood of vertex v. To compare 
this process with a multitype branching process, we can say that at step 3, a neighbor of v t , say 
w : is of type i if v t — w — mei for some integer m. In future iterations of the algorithm, a type i 
individual can only yield offspring of types j € {1, ...,<£} \ {i}, since all of its neighbors in the ej 
direction will have been removed from Ut+i- Thus, \C V \ is bounded above (in distribution) by the 
size of a multitype branching process with Zf distributed as a Binomial(ajn, -) random variable for 
j 7^ i and Z\ = when Zq = ej. The expectation matrix for this branching process is M\, and 
the largest eigenvalue of M\ is less than 1 iff A < A c . In this regime, we have that the moment 
generating functions for ((Z± — ej, fx) | Zq = ej) arc uniformly (in n) bounded above for 9 > 0: 



E 



9<Z 1 -e ijAt ) 



Zq = ei 



= e~ 9 ^ 



. / U/ \ n 



n e 



„0k. 



n 



A 



l + -{e e ^ -1) 



< e- efii ]Jexp [Aoj-(e°w - 1)] 

It is easy to verify that ^i(O) = - 1) < 0, V-'j(O) = 1, ^i{9) —■ 
all > which imply that we can choose a, C > such that 



oo as 



oo, and i>'({e) > for 



e a := minmax tpi(9) =: max ip i (9'), 

0>Q i i 



C-exp^llMlU- 

Now applying Proposition |2.2| to this branching process yields the following inequality: 



This implies the desired result: 



max|C„|> — — — log n ) < C(ai • • • a^n 1 . 



□ 



3 Supercritical behavior 



Theorem 1 1 . 2 1 shows that A c is the critical threshold for the emergence of a giant component. If we 
let Qi = P (Zt = for some t | Zq = ej) be the extinction probabilities for the multitype branching 
process in which an individual of type i gives birth to a Poisson(Aa.,) number of type j individuals 
for j ^ i and zero individuals of type i, then the proportion of occupied vertices in the largest 



component when A > A c is 1 — (Yii Qi) 



i/(d-i) 



l-g>0. 



Theorem (1.2 restated). Fix d > 2 and a\ 

positive solution to det(M \ c — J) = 0. If A > A 



> a,2 > ■ • • > a f z > 0, and Zef A c &e ifte unique 
V = \ then the size of the largest connected 



component of the random site subgraph of the Hamming Torus is a.a.s. (1 — ?)-^(IIi a « 
o(n d_1 ). Furthermore, the second largest component has a.a.s. O(logn) vertices. 



7 



Before beginning the proof, it will be useful to show that the matrix equation defining A c is equivalent 
to a polynomial equation that is easier to understand. 



Lemma (|1.3| restated). 



det(M A - I) = (-1) 



1=2 sc{i,...,d} ies 

ISM 



(4) 



From this reformulation, it is easy to see that a positive solution to det(M\ — I) = exists and is 
unique, since the polynomial in the above expression is monotone increasing for odd d (monotone 
decreasing for even d) and A > 0, and the value at A = is (— 

Proof of Lemma \1.3\ To demonstrate the validity of equation Q, we will first make a change of 
variables by letting 7 = 1/A, then multiplying both sides by j d , so the equation we must verify is: 
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(5) 



We will proceed by induction on d. For d = 1 both sides of equation ^ are equal to —7. For d > 1, 
we will start by showing that the derivative with respect to 7 on both sides of equation ([5]) is the 
same. Taking the derivative of the left hand side yields: 
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which is the derivative of the right hand side of equation (JHJ. The second line follows from the 
inductive hypothesis. The last line results from counting the number of values of j such that, given 
a fixed set 5 c {1, ... , d} of size £, S is also a subset of {1, . . . , d} \ {j} of size I. This is the same 
as counting the number of values of j such that j S, which is (d — £). Now that we have shown 
the derivatives to be equivalent, to verify equation |5| we only need to show that it holds for 7 = 0. 
To this end, we can factor the matrix on the left hand side of ^ when 7 = as: 
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Since all of these matricies are triangular, it is clear that the determinant on the left hand side of 
([5]) when 7 = is (— l) d ^ 1 (d — l)Yli a i: which is equal to the right hand side of ^ when 7 = 0. 
Thus, we have demonstrated equation Q. □ 



Proof of Theorem \1.2\ We begin by considering C v , the cluster containing a fixed vertex v, and 
reveal the vertices in C v using a similar algorithm to that described in the proof of Theorem 1, 
but with one noteworthy modification: We will remove some additional vertices at step 4 before 
they can be observed in any subsequent iterations. This will avoid the problem of generating closed 
loops, which would severely reduce the unseen neighborhoods of active vertices and make it difficult 
to couple the process with a lower-bounding branching process. The cluster-discovering algorithm 
is then: 

1. Initialize the sets Ro = 0, A = {v}, Uq = V \ A . 

2. Choose a vertex, v t , from A t in some measurable way (e.g., lexicographically or uniformly at 
random) . 

3. A t+ i = A t \ {v t } U {w e N{v t ) n U t I w is occupied} 

4. U t+1 = (U t \Af(v t ))\{w e M(u) nM(v) | u,v € A t+1 ,uyt v} 

5. R t+1 =R t U {v t } 

6. If A t+ i is empty, then return Rt+i, otherwise increment t and go to step 2. 

We will need to keep track of the number of vertices of each type, so let denote the number of type 
i vertices in A t , let At = ^4^, • ■ ■ , A ^^j , and let J\f' l (v) = {w e V \ v — w = mei for some m}. 



As explained in the proof of Theorem 1.1 a vertex, w, discovered at time t is labeled type i if 
w E Af l (v t ). That is, a vertex is of type i if it is discovered by searching neighboring vertices in the 
direction of the i th basis vector. 

The above algorithm will give a lower bound on C v , but we will make use of an upper-bounding 
branching process, as in the proof of Theorem to tightly control the size of C v . We can couple 
the random walk version of the upper-bounding branching process with *A t by using the same 
random variables in the construction of St as in the construction of A-t whenever possible, and 
when independence is an issue, we just add independent copies of these random variables to St- 
More rigorously, let be the random variable that takes the value 1 if the vertex v is occupied and 
is otherwise. Then we can define St iteratively for each i: 

a t n-\j^(v t )nUt\ 

St+x = S t % + ^ £ w + ^ Vk ~ ^{et is of type i}, (6) 

w^M i {v t )r\U t fc=l 

where the rj^' 1 ^ are i.i.d. Bernoulli(p) random variables, and are independent of the As long as 
A-t ^ 0, then we can choose the same focal vertex v t for both processes. But, if At = and St 7^ 
then we can just choose a vertex from the process St in some measurable way. 

3.1 Existence and Survival of a Lower-bounding Random Walk 

We will make use of this upper bounding random walk in a moment, but first we also need to 
introduce a lower bounding random walk - also based on a branching process. We will use the upper 
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bounding random walk to grow a sufficiently large cluster for the lower bounding walk to take over, 
but first we will need to know what "sufficiently large" means. Also, we need to know that such a 
lower bound exists, so we begin there. Ideally, we would like to have a lower bound on \U r C\N l (v)\ 
for some large enough r. Using the Erdos-Renyi model as guidance, we need n^ 1 ^ 2 «r« 
so we choose r = n d ~ 5 . 

To generate the lower bounding random walk, we will begin by letting p = pi + P2 — P\Pi , 
where p\ — ^ and p = ^. We want to choose Ai < A so that Ai > A c , but Ai is smaller than 
the positive solution to det(M^~ 1 ^ — = o, where TW^ -1 -* denotes the submatrix of M\ 

consisting of the first (d — 1) rows and columns. Call this solution \i d ^ In other words, we want 
Ai to be supercritical for the d-dimensional process, but subcritical for the process restricted to any 
(d — l)-dimensional plane. By the monotonicity of the process of growing a cluster, the critical value 
for the (d — l)-dimensional process obtained by considering the first (e? — 1) dimensions is smallest, 
since we assumed that a\ > a-i > • • • > a d > 0. The critical value for this process is X^ d , so to 
guarantee the existence of such a Ai, we just need to verify that A c < xi d ~ 1 ' . This is easiest to see 
by using equation (El). The values of det(iVf A (d-i> — I) and det(Afo — I) have opposite signs, so by 



the Intermediate Value Theorem and the uniqueness of A c , we have that < A c < x{ d . 

Consider the process of discovering a cluster starting with m = O(logn) vertices (we will decide 



what m is exactly just before equation (13)) of arbitrary types in which each unseen vertex has 
probability p\ of being occupied; call this process A! t . The corresponding active set of vertices will 
be referred to as A' t , and we have assumed that \A' \ — m. These m vertices will ultimately come 
from running the cluster-discovering process up to some time s — /3 log n starting from a single 
vertex and using parameter p in Section |3.2| Conditioned on the survival of this process, we will 
have that |i?Q| = \R S \ = s, where R' t is the set of occupied vertices which have been removed from 
the active set up to time t when we start with m active vertices and use parameter p\. Also, let U' t 
be the set of unobserved vertices up to time t, and Uq = U s . Denote by 

:= {* e ^ I «*x = *i, • • • , «i» = 4} (7) 

the intersection of the vertex set, V, with the planes Vi ± = ix,...,Vi h = £k- Throughout this 
argument we will assume that 1 < . . . , i). < d, ij ± ^ ij 2 for j\ ^ j 2 , and £j € {1, . . . , ai n}, so 
we have no trivial constraints. We want to bound the number of vertices that become occupied up 
to time r in any 2-dimensional plane, V^' ■ This will give us the desired lower bound on 
\U' r n A/" J (V)| for each i and v' that appear in the cluster discovering algorithm. To get a bound 
on the number of occupied vertices in these 2-dimcnsional planes, we will work backwards by first 
getting a bound on the number of vertices that become occ upied up to time r in (d— l)-dimensional 
planes of the form V^\. In the statement of Lemma 3.1 we allow m and s to be larger than we 



will need for the proof of Theorem \1.2\ This is because we will also use Lemma |3.1| in the proofs of 
Theorems |1.5| and |1.6| where m and s will be slightly larger. 

Lemma 3.1. // \A' \ = m = o(n 2 / 3 ), \R' \ = s = o(n 2 / 3 ) and r = n d ~ 4 / 3 , then there is a constant 
Kd such that: 

P pi (\ V (t..£-2) n ( A 'r U R 'r)\ > K d n 2 ^{logn) d - 2 for some . . . ,i d _ 3 ) and (4, . . .,4- 2 )) 

= 0(n~<< d+1 '>). (8) 



Proof of Lemma \3J\ We begin with d = 2. In this case, equation ([8|) is simply the statement that 
at most Kir?! 3 occupied vertices have been discovered by time r = n 2//3 , so we merely need a large 
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deviation bound. At each step, we can discover at most a Binomial(etin, — ) number of occupied 
vertices. Thus, 



(\A' r U R' r \ > 2a 1 X 1 n 2 ^ < e -2a x X 1 r Ee \A' r uR! r \ 



< e 



-2ai Air+m+s 



l+^(e-l) 
n 



< exp [— 2aiAir + aiAi(e — l)r + m + s] 
= 0(n~ 3 ). 

In the first line above we exponentiated and applied Markov's inequality, in the second line we used 
that the total number of births up to time r is stochastically bounded by a Binomial(ainr, 
random variable, in the fourth line we used the bound 1 + x < e x , and in the last line we used that 
m+s = o(n 2 / 3 ) and e - 1 < 2. Thus we have fah for d = 2 with K 2 = 2a x Ai. 

Now, for d > 3, first consider a fixed plane Tv^) ■ ^ each step in the cluster discovering algorithm, 
either a vertex in this plane is chosen or a vertex in some other parallel plane is chosen. If a vertex 
in this plane is chosen, then if we restrict our attention to just this plane, that is we remove Af l1 (v' t ) 
from Af(v' t ) at step 3 of the algorithm, the resulting process will look like a subcritical branching 
process in (d — 1) dimensions, by the assumptions we made on Ai. Thus, restricted only to this 
plane, no vertex will give rise to a cluster larger than /5i log n vertices with at least probability 
(1 — n~ d ~ 2 ) (take j3\ = in the proof of Theorem 1). If, instead, a vertex in another plane 

is chosen, then this vertex may have a single opportunity to give rise to an occupied "seed" vertex 
in 7* fa). If the potential seed vertex has already been examined during a previous iteration of the 

algorithm, then it will not increase the number of seeds in vY' 1 } whether it is occupied or not. If the 
potential seed vertex has not been examined (it is in Ut), then it will be occupied with probability 
^r*. So by time r, the expected number of seeds is < ^p, and each seed is associated with a cluster 

restricted to Vfa) of size at most j3\ logn. If we let X r be the number of seed vertices generated in 
Vfa^ by time r, then X r is a sum of a random number of Bernoulli(^ L ) random variables. But X r is 

stochastically bounded above by X' r ~ Binomial(r, ^), since the number of potential seed vertices 
cannot exceed r. Using this fact, and applying a standard generating function argument shows that 
for 7i = 2 log 2 - 1 > 0, 

X r > 2^) , 

n 

,d-i 



n- d - 2 



Recalling that we have chosen r = n 3 , this implies that 

P ( Vfa 1 ] n {A' r U R' r ) > 2X 1 /3 1 n d -^ 3 logn) < e -^ Al "^ <7/3) 

and since the number of planes, "Pfa)i is < da^n, that 

V^J n (A' r U R' r ) > 2\ 1 p 1 n d - 7 / 3 logn for some (<i) or (4)) = 0(n~( d+1) ). (9) 

Now we wish to show that if every plane of the form "pj il '—' tk - 1 ^ nas a t most TV occupied vertices 

iyti . . . . ,tk — 1 ) 

by time r, then with high probability, every plane of the form ■p^ 1 '- ''^) nas a t most 2k\\—j3k logn 
occupied vertices by time r. The reasoning is similar to the argument above for 'Pfay Since no 
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A-i) 

,ih-l) 

P^ 1 ""^ \ pV u "'? h J has more than iV occupied vertices. Let us fix p^ 1 '-'^ an d denote by 

(i 1 ,...,i k )\(i j ) " (ii,...,ij_i,i J + i,...,i fc ) \ 

In one iteration of the cluster discovering algorithm, the focal vertex, vl, is either in P^ 1, '"'? h ), 
in 7'/' 1 '""'^\w^v for some 7 = 1, or neither. If it is in neither of these sets, then Af(vt) n 

^Ui'" «V) = ^' so * ms case does n °t contribute to the number of occupied vertices in this plane. If 
the focal vertex is in Pfa'"' , fj\, then we can restrict our attention to just this plane, and the process 
looks like a subcritical process in d— k dimensions (by our assumptions on Ai). That is, we remove 
U^ =1 Af' lj (v' t ) from M(v' t ) at Step 3 of the cluster discovering algorithm. Therefore, all vertices 
in -p^ 1 ' - ^' belong to clusters restricted to •p^ 1 '—^ f size at most /3fclogn with probability at 



plane of the form V/- 1 h \ has more than N occupied vertices, then certainly no set of the form 



least (1 - n -( d + fe+1 )) (take & = Finally, if the focal vertex is in Vi ■'"•!•},■ } for some 

j = 1, . . . , k, then it has precisely one neighbor in ' "f^- If this neighbor is unseen (is in U t ), 

then it has probability ^ of being an occupied seed vertex. There are at most kN occupied vertices 

in p,. 1 ' ' "'tyX. 3 ? for j = l,...,fc, so the number of seed vertices, Yh, is a sum of at most kN 

Bernoulli(~) random variables, and is thus stochastically bounded above by a Binomial (A: TV, ^) 
random variable. Thus, we find that 

Y*>2^ kN -)<e-^, 
n J 

where, again, 71 = 2 log 2 — 1. So, conditional on the event that no plane of the form 'Pfa'"'** - ^ 
has more than N occupied vertices by time r, we have that: 

p(t u ...,l h ) hafj > 2k\ 1 f3 k — \ogn occupied vertices by time r) < e ~ 7lAlfc » + n~ d ~ k ~ 1 . 

Since the number of planes, T 3 ^ 1 ' '"'/*?) is at most (^)a^n fe , and provided that A = u;(n 1+£ ) for some 
e > 0, we have: 

Any V^'""f^ has > 2k\ 1 p k — logra occupied vertices by time r j = 0(n~ (d+1) ) (10) 

conditional on the event that no plane of the form -p^ 1 ---- A- 1 ) more than N occupied vertices 
by time r. 

Finally, we can put together equations ^ and ( 10 ) to get: 

Any Vfa'""?*'*) has > K d n 2/3 (\ogn) d ~ 2 occupied vertices by time rj = 0(n~ (rf+1) ) 

where K d = 2 d ~ 2 ■ (d - 2)1 ■ \ d ~ 2 ■ (jjfzl ft) □ 

So with probability at least 1 — 0(n~( d+1 ^), for t < r and v t of type j ^= i, 

\U' t nN\v' t )\ > ai n- K d n 2 / 3 (log n) d ~ 2 
> di(l — 5)n 
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for sufficiently large n, where S > is such that (1 — <5)Ai > A c . This implies that, up to time r, our 
process of discovering occupied vertices is bounded below by the random walk version of a multitype 
branching process with expectation matrix M(i_g)Xi- Let us call the random walk version of this 

multitype branching process W t = (W t , . . . , W t ), and assume that Wo = A! . We can couple 
the processes W t and until time T A T$ A r, where 

To :=inf{t | W t = 0} , 

Tg :=inf {t | \U t fl A/' J (i'' t )| < a«(l — <5)n for of type j and some i ^ j} , 

w (i) 

by first choosing v' t as follows: first decide that v' t will be of type i with probability py^jj , then 

choose randomly a vertex from ^ that is of this type. Let 1Z S (v' t ) be the unseen neighborhood 
of v' t restricted to the first (lexicographically) aj(l — S)n elements of Ut fl J\f l {v' t ). Then we can 
complete the coupling of Wt with Al t as 

W/^ = W t W + £ C is of type i} (12) 

where £' w is the random variable that takes the value 1 if vertex w is occupied in the random site 
subgraph obtained using the probability parameter p±, and is otherwise. 

We wish to show now that this coupling will last until time r if to is large enough. This will 
occur whenever the branching process corresponding to Wt survives. Let 

q'i = P (Wt = for some t | W = e 4 ) 

for i — l,...,d. Since this branching process is supercritical, q[ < 1 for each i. If we let e -72 = 
maxi for 72 > 0, then 

P (Wt = for some t | WWq^ = m) < ( maxt/H 

So if to = log n, then the coupling between Wt and .4^ will last until time r with at least 
probability 1 - n~( d+1 \ Thus, conditional on \A' \ > to, \A' \ = o(n 2 / 3 ) and |i? | = o(n 2 / 3 ) we have 

¥ pi (\A' r \=0) = o(n-^). (13) 
3.2 Establishing a Set of m Active Vertices 

As in the proof of Theorem |1.1[ we make use of an upper-bounding branching process, but this time 
in a more intimate fashion. We couple the random walk version of the upper-bounding branching 
process, S t , with A± as in equation ([6]). Recall from equation ^ that we can write 

d 

S t +1 = 1 {h+l=i} ( X t+1 ~ e i) > 

i=l 
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where jt+\ is the random variable that takes the value i if v t is of type i (or, when «4. t = 0, if a type 
i individual is chosen at time t + 1 in the random walk process), and where the X\ are independent 
and distributed as Z\ conditional on Zo = e,. For this process (and later for the lower bounding 
process, W t ) we will need the following large deviations bounds. 

Lemma 3.2. Let St be the random walk version of a branching process with d types in which an 
individual of type i has a Binomial(ajn, — ) number of offspring of type j ^ i and zero offspring of 
type i. If M\fjb = p\x, where p > 1 and /x is the corresponding positive eigenvector normalized so 
that \\/J,\\i = 1, then for y < 1 < x 

F({S t -S ,i*)>x(p-l)ii max t) <e-" 14 (14) 

P ((S t - S , n) < y(p - l)p mm t) < e-^ (15) 

where p rna x '■= rnaxj(/ii), p m m '■= niirii(^j), and 771 , 772 > depend on x and y, respectively. 



Proof of Lemma \3Jq We will first prove inequality (14). First, as in the proof of Proposition (2.2 1, 
we consider an increment of the process (St — So, /x): 



MO) :=E 



exp 0(^l {jt+1 =i} ( X l+i - e i 



1 M 



S t = v 



S t = v) Ee^W+i-^)^) 



< max Ee 



i>(9). 



9((X*-e ( ),M) 



Using this estimate, which is independent of the previous state vector, v, we see that 



E 



,0<S t+ i, M > 



s *] = E 1 {s t =v}z e(v ' >l) MO) 



This implies that 

By equation ([I]), for each i = 1, . . . ,d 

d 

de 



< ip(0) e 
Ee e<s t -s ,M) < 



(16) 



9=0 



= (p- l)Mi 

< x(p- l)/i 
d 



df) 



)=0 



since p > 1 and ix > 0. Because ^(0) = 1, this implies that there exist #1,771 > such that 



Wi) e 



-$ix(p— 1)jU„ 



< 1. 



Thus, by Markov's inequality and inequality (16): 



V((S t - So, n) > x( P - l) Mmax i) = P ( e *(*-So,M> > e ^-(P-i)^ 

< E e £ 'i(' s t _s o,M> e -6i2;(p-l)/j max 
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So we have proven inequality ( 14 ). To prove inequality ( 15 ), we need the following estimate for each 
i = 1, . . . , d: 



d6 V 



6»=0 



"(p-l)Mi 

< -y(p- i)m 

d 



-By{p-l)n n 



dO 



0=0 



since p > 1 and /i. > 0. Because ip{0) = 1, this implies that there exist 02,112 > such that 



Thus, by Markov's inequality and inequality (16): 

P «S t - So, n) < y( P - l)/i min t) = P ( e -^(s*-So,#i> > e -fcv(p-i),w» t 

< Eg-e2(s t -s ,/x> e e 2a (p-i) Mmin 

< e- mt . 



We are now ready to prove the following lemma: 
Lemma 3.3. There exists a constant ft depending on a\ 



□ 



and 



where Klogn > to. 



, ad such that if s — (3 log n then 
P(0< HAIIi <m) = o(n~( d+1) ) . 

P (|| All! > Klogn) = 0(n-( d+1 >) . 



Proof of Lemma \37j\ We wish to apply Lemma [372] to St, but the first step of this process is not the 
same as the subsequent steps, so we must handle this case separately. Notice that in the first step 
of the cluster-discovering algorithm, the initial vertex does not have a type, and is free to search 
in any of the d directions, while all subsequent vertices are assigned types, and thus expand like 
a branching process. Thus, for each i = 1, . ..,<£, s| is distributed as a Binomial(ain, ^) random 
variable. We can thus bound this step of the process: 



P((Si-S 0) ju) > (d + 2)logn) 



e (S 1 -S .M> > e (d+2)log« 



< n 



O 



-{d+2) 



exp 



^ a,-A(e Mi - 1) - /i n 



(17) 



We only need this upper bound, since the first step of St is stochastically bounded below by any 
subsequent step, so inequality (151 still holds. We now apply Lemma 3.2 to S t at time s — /31ogn 
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with x = 2 and y —\ (and use the inequality in (17) to handle the first step) to yield 
P«S.-S ,M>> [2£(p-l)AW + (d+2)] logn) < + O (n^ d+2 ^ 

[S s - S ,fi) < \P{P- ^Mmin lognj < n-" 2 ' 9 . 

If we choose /3 > such that r?i/3 > d+1, 772/? > d+1 and /3 > ^ m °^+^ (recall that m = 4^ log n.), 
and we let if = 2,3( P -i)Aw*+(rf+2) + (3 + 1, then 

Mmin 

F (||S' S || 1 + s > if log n) < n-^+V (18) 

P f 115.1k < ^logn) < (19) 

To obtain the above inequalities, we used the estimates (S s — So,fi) > Mmin ||5 s |k — H^olloo, 
(S s -S 0> ti) < fhnaxWSsW-i and H-Soll^ //Xmin < log n for n sufficiently large (HSoiloo = !)• Equa- 
tion (18) implies the second part of the lemma, since H^lk < ||5 s |k- 
When IIS 1 ,,)^ + s < K log n, we have: 

\U S r\Af l (v)\ > can- K logn (20) 

for v of type j ^ i. This is because \A S \ < \R S \ < s (with equality here if At is still alive at 

time s), and the maximum number of neighbors that A s U R s can have in Af l (v) is \ A S U R s \ as long 
as v is not of type i. Therefore, provided A s ^ 0, the number of births that occur in St which are 
lost in At for t < s is at most 

s _ 1 d Oin-|t7 t rW*(i>t)| 

EE E vt } <y {s) (2D 

t=0 i=l k=l 

where Y^ s ' ~ Binomial (d/3K (log n) 2 , So, conditioned on ||5 s |k + s < K\ogn, we can bound 
the difference between S s and A s by a constant: 



I Alia > 0, \\s s - Alii > d + 2) < p (V s ) > d + 2) 

<f(logr>) 2 



d/3if(logn) 2 x 2 \ / \ \ fc / , \ d/3K(lo g n) 2 -k 

I dpK (log nf \ ( A\ I A x 



k=d+2 
d0K(logn) 

< E 

k=d+2 



k 

Ad/3if(logn) 



1 - 

n 



O (V< d+1 >) . (22) 



Finally, combining inequalities (18), (19), and (22) yields 



0< HAHi < log n- (d+2) J =0(n-( d+1 ') 

0< || Ash < ^ lo S«) = 0(V (d+1) ) 
P(0< H-4.ll! < m) = O (n- (d+1 ^ 



□ 
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This means that, with high probability, at time s the process has either died out or there are at 
least m active vertices (in A s ). 



3.3 Merging Clusters of Size r = n d 4 / 3 

At this point, with high probability, the cluster-discovering algorithm started at a given vertex, v, 
has either died out by time s = (3\ogn, or will continue to survive until at least time r — n d ~ 4 / 3 . 
If the process has died out, then the size of the cluster containing v is at most /31ogn. What we 
now intend to show is that, if the cluster containing the vertex V\ and the cluster containing the 
vertex v 2 each have at least r vertices, then Vi and v 2 are in the same connected component of the 
random site subgraph with high probability. 

The clusters containing v± and v 2 will be generated as follows (we use t>i/ 2 to denote "vi or v 2l 
respectively" , and likewise for similar notation) : 

1. Start the cluster-discovering algorithm, (R t , A t ,U t ), with parameter p at R — 0, Aq = v 1 / 2 
and Uq — V \ Aq. Continue until time s — /31ogn. 

2. Start the cluster-discovering algorithm (R' t , A' t ,U{.) with parameter pi at R' Q — R s , A' Q = A s 
and Uq — U s . Continue until time r = n d ~ 4 / 3 . 

3. Let T 1/2 =R' r \R s . 

Note that, conditioned on the survival of each process, | TC 1/2 1 = r, and T 1 / 2 C C„ 1/2 . 
Lemma 3.4. IfT± and T2 are defined for v± and v 2 as above and |Ti| = IT2I = r, then: 

¥ P2 (C V1 ^ C„ 2 ) = 0(n-^) 



Proof of Lemma 3.4 We will employ a sprinkling technique using the probability reserved in the 
first step, where the lower bounding process was defined using probability parameter p\ = \\jn. 
Observe that we can write p = p\ + p-2 — P1P2 1 where p — X/n and p 2 is defined in this way. The 
random site subgraph of G with parameter p, call it G p , can then be seen as G p = G Pl U G P2 , where 
G pi and G P2 are random site subgraphs of G that are independently generated, the union is taken 
over their vertex sets, and the bar denotes the inclusion of all edges from G between vertices in 
G Pl U G P2 . We will further subdivide p 2 in the same manner, but first it is crucial to note that: 

^ ~ ^1 . ^l(^-^l) 6 . ^.t _2\ 

P2 = + —7 ^ = ~ + 0(n ), 

n nyn — \\ ) n 

where e = A — X%. We can now write 

2d-4 2d-4 

^=E^- e p^+---+(-i) 2d - 3 n^ ( 23 ) 

t=l l<i<j<2d-4 i=l 

where p\ = eijn + 0(pT 2 ) and J>2% e i = e - This is done merely by repeatedly subdividing G p into a 
union of independent subgraphs as described above. Also, for the sprinkling argument to work as 
intended (so we can avoid dependencies), we will consider only the vertices in U s , since the vertices 
in V \ U s have already been fully considered for inclusion in G p . Since \U S n Af l (v)\ >ain~ Klogn 



whenever v € U s (see inequality (20)), this restriction will not significantly affect our estimates 
below. 
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We will first treat the cases d = 2, 3 separately, then use an induction argument to prove the 
Lemma for d > 4. 

Case 1: d = 2. We first observe that with high probability no row or column {P^j for i e {1, 2} 
and £ € {1, . . . , ciin}) contains more than log 2 n vertices in T 1 / 2 because: 



> log 2 n for some (i) and (£)j < P ^ P^ n G p | > log 2 n for some (i) and (£)J 



< 2a\n 



am , s 



< 2ainexp [— log 2 n + (e — l)aiA] 
= C^n 1 - 108 "). 

If no column ("P^j for I G {1, . . . , a\n\) has more than log 2 n vertices that are in T 1 / 2 , then by the 

Pigeonhole Principle there are at least r/log 2 n columns with at least one vertex in Ti/ 2 . Thus we 
have: 



P^jnA^Ofi^) 



< 



log 2 n 



(24) 



We denote by B P2 (T 1 / 2 ) the set of vertices in Af 1 (T 1 / 2 ) that are independently made occupied 
with parameter p 2 . Note that B P2 (Ti/ 2 ) contains independent copies of any vertices that have been 
previously observed. If there is an edge in G between Ti and T 2 then v± and v 2 are in the same 
component and we are finished. Otherwise, we seek to bound the probability that there is no edge 
in G between B P2 (T 1 ) and B P2 (T 2 ). If there is a column that contains a vertex from each of these 

two sets then the two sets are connected. Let Ei be the event that column £ {V^ ) contains a vertex 
from B P2 (Ti) and B P2 (T 2 ), and let E be the disjoint union of these events: 

E t = {p$ n s P2 (Ti) ^ 0} n {P§ n b P2 (t 2 ) ? 0}, 

ain 

E=\_\E e , 



We wish to bound the likelihood of the complementary event: 

P {E}) < 1 - fl - (1 - p 2 )" 2/3 / lo s 2 



< 1 - 

< 1 - 

<1- 

< exp 



1 — exp 



n 1 / 3 log 2 n 
e 2 



i 2 



i 1 / 3 log 2 n 2n 2 / 3 log 4 



2 n 2/3 } g 4 n 

r 2 



2 n 2/3 } g n 



In the second and fifth lines above we have applied the bound 1 — x < e x , in the third line we used 
1 — e~ x > x — x 2 /2 for x > 0, and the fourth line bounds the third for sufficiently large n (which we 
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denote by < ). Because the events Eg depend on disjoint columns of vertices, they are independent, 
so we have: 



¥{E C ) 




2 log 4 n 



(25) 



So, combining ( 24 1 and (|25j) , for d = 2 the probability that i>i and v^. are not in the same component 
is bounded above by O{rr logn ). 

Case 2: d = 3. We would like to begin by sprinkling vertices in orthogonal directions from each 
of the sets Ti and T 2 , but we would like to sprinkle in the directions that will give the greatest 
likelihood of adjoining the two sets. For this, we need the following fact about subsets of Z 3 . 

Lemma 3.5. Let A C Z 3 , and let A x , A y , and A z , respectively, be projections of A onto the 
coordinate planes, yz-plane, xz-plane, and xy-plane. Then 

\A\ 2/3 < \A X \ 1/3 \A V \ 1/3 \A Z \ 1/3 . 



Lemma 3.5 is a restatement of the main theorem of for d = 3, and it implies that at least one of 
the projections has size at least |A| 2 ^ 3 . Applying this to the set Ti, suppose without loss of generality 



(since the constants will not play a significant part here) that Vffl n A/" 3 (Ti) 



> r 2 ' 3 for all 



Throughout this section, we will also condition on the complement of the event in Lemma 3.1 which 
implies that no two-dimensional plane orthogonal to one of the coordinate axes contains more than 



i 2 / 3 logn vertices in T 2 . In particular, this implies that 



1 (2,3) n 12 



< C*n 2 / 3 log n for all (£ 2 ,4) 



This bounds the number of points that can map to the same point when the set T 2 is projected 



n A/" 1 (T 2 ) 



> 



C log n ' 



onto the span of and es, so the Pigeonhole Principle implies that 

We begin by sprinkling occupied vertices independently in the neighborhood Af 3 (Ti) with prob- 
ability p[ = €\jn and in the neighborhood A^ 1 (T 2 ) with probability p' 2 — e 2 /n. We will refer to 
the sets of newly added vertices (including those which may have already been present in G Pl ) as 
B 3 , (Ti) and B^, (T 2 ), respectively. The key observation here is that and V2 will be in the same 
component if M 2 {B 3 , (Ti)) n N 2 {Bl, (T 2 )) ^ 0. That is, if there exists a pair (£i,£ 3 ) such that 



(1,3) 



contains a vertex in B 3 , (Ti) and a vertex in (T 2 ), then and i>2 must be in the same 



the event that V^ 1 ^ contains a vertex in B 3 , (Ti) and a vertex in 



component. Denote by 

Bpi (T 2 ), and let E be the disjoint union of these events over all pairs (^1,^3): 

E (Ws) = ] n B P i(Ti) ^ 0} n {v^ n b p ,(t 2 ) ? 0}, 

ain a^n 

E = U U E (^y 
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We now define the following notation: 

fcg> := # {v 2 E {1, . . . , a 2 n} : « G V§> fl Ti j , 
fcg } := # {«a 6 {1, . . . , a 2 «} : « € pgf n T 2 } . 

In other words, k^ is the number of columns, V^ 1 ^ C pfl 1 ', which contain at least one vertex in 

Ti. Similarly, k^ is the number of rows, V^fcd C , which contain at least one vertex in T 2 

(see Figure lk. Note that fc^ = 0(n 2 ^\ogn) and k^ = 0(n 2 ^ 3 log n), since we are conditioning 
on the complement of the event in Lemma |3.1| 

^3 = h 



v 2 

► 

Vl 




Figure 1: Solid triangles and circles are vertices in Ti and T 2 , respectively. Empty triangles are 
neighbors of vertices in Ti in the ^3 direction, and empty circles are neighbors of vertices in T 2 
in the v% direction. Then ki is the number of triangles (solid or empty) in the column v% = l\ 



(e. 



0, k 



(i) 



( 2*1 

2), and k\ is the number of rows with circles in the plane W3 = £3 (here 

(2) 

kj, 3 — 2). Also, E(£ lt £ 3 } is the event that in the column v\ = l\, at least one triangle (solid or empty) 
is made occupied with parameter p[ and at least one circle (solid or empty) is made occupied with 
parameter p' 2 . 



We can now write the probabilities of E?^ ^ , and E c , conditional on the values of k^ and k 



„(2) 




=U 3 =i 




,„(2> X 



1- 1-1- 



c \ k {1) 
ei \ K t 1 



1-1- 



C2 
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and we wish to bound these quantities. To this end, we observe that for large n: 



1-1 



ei/2 
n 



ke 1/ 



> 



> 



fcei/2 _ / fegi/2 
n \ n 

fcei/2 
2n ' 



2 ft 

£ 

i=0 



fce 1/2 



(26) 



provided A: = o(n), which is the case for and fc^- Applying this inequality and 1 — x < e x 
gives: 



(E 



II II 

^1=1^3=1 



( 2 ), 



2/; 



2n 



< exp 



4n 2 



£ k ti £ 



(27) 



What remains now is to obtain lower bounds for the sums in the above expression. It follows from 
the definitions of fcj and kf^ that for any 1% 



> r 2 ' 3 = n w ' 9 . 



«i=i 



and for any 



> 



C log n ' 



Applying the last two lower bounds to (27 1, then handling the e xcep tional case where some plane 
contains more than Cn 2 ^ 3 logn vertices in Y1/2 by using Lemma 3.1 yields 



1 (E 



exp 



£ie 2 



An 2 \ C log n 



Case 3: d > 4. We will ultimately use the same argument here as in Case 2, but first we 
will sprinkle vertices in the directions e^, ed—i, ■ ■ ■ > 64 sequentially. At that point we will have 
llf=4 a i « d ~ 3 independent 3-dimensional Hamming Tori in which the two sets can connect. So that 
we can independently connect vertices to the sets Tx/2, we will write P2 as in equation (23) with 
k = 2d — 4. We will denote by B l p (J) the set of vertices that are independently made occupied with 
probability p in the neighborhood J\f l (J). 

We now begin by sprinkling vertices in the direction to generate the sets Bz, (Tx/2). By 

"1/2 ' 

conditioning on the complement of the event in Lemma 3.1 we have that for every . . . , td-i) 

< 1 ;:/-r ) l)nT i/2 <cn^{\o gn ) d - 2 . 
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This bound says that when the sets T]y 2 are projected onto the span of e±, . . . , e^—i, no more than 
this many vertices can be mapped onto the same point. Thus, by the Pigeonhole Principle, we have 
a lower bound on the size of the projection: 

n d-2 



7>SW d (T 1/2 ) 



> 



rr 



CnV 3 (\ogn) d - 2 C(logn) d - 2 



for any id- Conditional on this bound, we obtain the following bound on the the number of newly 

occupied vertices in Bi (T^) in a fixed (d — l)-dimcnsional Hamming torus: 

"1/2 



-,d-3 



< 



(log n) 



d-1 



< exp 



exp 



< exp 



n d-3 
(logn)^ 1 

n d-3 

(logn) d_1 
n d-3 



■ Eexp 



-Pld) nB kj T ^ 



'(1-e- 1 ) 



l _ !lZ 2 

n 



< 



exp 



(logn)^ 1 C^logn^- 2 



(logn 



id— 2 



for some constant 77 3 > 0. In the first line above we exponentiated and applied Markov's inequality. 
In the second line we used the moment generating function for a binomial random variable, and 
in the third line we used the bound on the size of the projection that we obtained above and the 
inequality 1 — x < e~ x . By the union bound, we have the following bound on the number of newly 
occupied vertices in each (d — 1) -dimensional Hamming Torus: 



3 l d s.t. 



2 (T 1/2 ) 



^d-3 



< 



(log n) 



< dan ■ exp 



n d-3 



-V' 3 



< 



exp 



n 



(\ogn) d 2 
d-3 



-m 



(\ogn) d ~ 



(28) 



for some constant 773 > 0. For d = 4 this is as far as we will need to project before applying 
the argument from Case 2. For that argument to work, and so we can continue to project down 
to lower dimensions for d > 4, we will need an estimate for the maximum number of vertices in 
^(df n (T 1/2) that can map to a single vertex when the set is projected onto a plane of one 
less dimension. This will be easy by again using Markov's inequality and the moment generating 
function for a binomial random variable. For any 1 < i\ < i 2 < • • • < id-2 < d: 



^3 , ■ • ■ , ^i d _2 1 ^d) S.t. 



V, 



<[|ai n d 1 exp [— log 2 n] ■ Eexp 



^::;fe) £d)n ^ /2 ( T v 2 ) 



= Y[ a i nd 1 CX P [ — l°g 2 n ] 

i 

<Y[cii n d ~ 1 exp 

i 

< exp [—774 log 2 n 



l+^(e-l) 
n 



o 1 



i d-2 



% nAA d (T 1 y 2 ) 



2 ^Cn^logn)^ 

— log ' ( e ~ 1) 

n 



(29) 
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The third and fourth lines above are conditional on the complement of the event in Lemma 3.1 and 
assume that the sets Ti/ 2 have been determined accordingly. 

Now, if d > 4, we will repeat the above argument until we are within 3-dimensional Hamming 
tori. Suppose, for some k (1 < k < d — 4), that 



' (d-k+l,...,d) 



d) n B d-k+l ( B 



"(2k-l)/(2fc) f(2fc— 3)/(2fc-2) 



d-fe + 2 
P'f 



> N 



(30) 



for all (id-k+i, ■ ■ ■ , d-d) where N = uj(n log 3 n), and that 



V. 



(ii, 



■'^d-fc-i'^-fc+i,---,' 
id-k-itd— k+l,...,d) 



n B~j 



d-k+i <b*7 k +' 2 (• • ■ (Bt (T 1/2 ))---) <log^n (31) 

°(2 fc -l)/(2 fc ) V P(2 fc -3)/(2 fc -2) V V Pl/2 V i/2 " ' ~ & V ' 



jd-fc+2 



for all 1 < i\ < ■ ■ ■ < id-k-i < d — k + 1 and (i^ , 



.D^d-fc+i, ■ ■ ■ Jd)- The first inequality 



above says that the sets Y 1 / 2 each have at least N (most recently occupied) neighbors in each of 
the Q(n k ) (d — fc)-dimensional sub-Hamming tori after newly occupied vertices have been sprinkled 
successively in the directions e^, . . . , ed—k+i- The second inequality says that no line contains more 
than log 2 n of the most recently added occupied vertices. Under these two assumptions, and by the 
Pigeonhole Principle, we have that the size of the projection onto the span of e±, . . . , e.d— k— 1 is 
bounded by: 



■T>(Ld-k, 

1 (d-k,. 



,..M n N d-k {B d- k+ i ( . . . {B d (T } } . . . ) 

■■■' a > P(2fc-l)/(2fc) F l/2 ' 



> 



N 
log 2 n 



for any (£d-k, ■ ■ ■ Ad)- Using Markov's inequality and the moment generating function for a binomial 
random variable as we did above, we have that 



3 {l d _ k ,..,,ld) s.t. 



V 



£ d - k ,....e d ) 

(d-k,. ..,d) 



n B d r k 



( B a-K+i (...IB* (T 1/2 ))---) 

(2fc + l)/(2fc+2) P(2fc-l)/(2k) P l/2 ' 



< 



exp 



fc-l) 

N 
n log 2 n 



< 



N 



n log 3 n 
(32) 



for some constant 7?2fc+i > 0. Similarly, for all 1 < %x < * • • < id-k-i < d — k and all t 



v, 



d—k l r>d— k +l 



(ii,...,id-h-2,d—k,...,d) 

which implies that 



nJ\f (B , 



(■■■(B d ,(T 1/2 ))---) 



J (2k-l)/(2k) 



< log n, 



3 t s.t. 



V 



(ii 



,id-k-2,d—k,...,d) 



n s d r fe 

p, 



(Jjd— fe+1 



(2fc + l)/(2fc + 2) ^(2fc-l)/(2fc) 

< exp [-??2fe+2 log 2 n] . 



(•-(Sj /2 (T 1/2 ))...) 



> log nj 
(33) 



Putting together inequalities ( 28 1 through ( 33 ) inductively gives (for d > 4) : 
3 i s.t 



■T){ii, — 4d) d4 

v {A,...,d) ni v 



(Bp 

(2d-7)/(2d-6) P(2d-9)/(2d-8) 



(•••(5j /2 (T 1/2 ))...) 
< exp [-rj2d-5 log 2 n] . 



< 



(log n) 



4d-13 



(34) 



Here, I — (£4, . . . ,£d), and henceforth I will be the vector in the superscript of the next "V" . We 
used the union bound to control the probability above, and the constant 77^-5 > is chosen such 
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that the sum of the probabilities of the events involved is bounded asymptotically by the above 
expression. These events are still conditional on complement of the event in Lemma |3.1| Likewise, 
we have: 



3 £ s.t. 



^(l,2,4,...,d) n B P 



(B 5 p , (•••(R d ,(T 1 ))--.) 
< exp [-?72d-4 log 2 n] , 



> log n 



(35) 



and 



3 £ s.t. 



V. 



ta,e 3 ,U,...,£ d ) 
(2,3,4,...,d) 



nBt 



(2d -6) ^(2d — 8) 



(•••(S5(T 2 ))--.) 



> log n 



< exp [-?72d-3 log 2 n] 



(36) 



Within each 3-dimensional Hamming torus, "P^ i '"'^ d \ we can now apply the argument from 
Case 2. This time we will not need Lemma 13.51 because we have a better bound on the number 



of (most recently) occupied vertices in any line. Fixing (£±, . . . ,£d), if we project the vertices most 
recently added to Ti onto the plane spanned by e± and e 2 , then inequalities (34) and (35) imply 
(by the Pigeonhole Principle) that with high probability 



> 



(37) 



(logn) 4 ^ 11 ' 

Likewise, if we project the vertices added most recently to Y 2 onto the plane spanned by e 2 and 
and apply the Pigeonhole Principle with inequalities (34) and (36), then the size of the projection 
within each 3-dimcnsional sub-Hamming torus is at least: 



(l,4,...,d) 



n7V 1 (s5 (• • • (B d p , (T 2 ) ) • • • ) 



> 



(log n) 



4<2-ll ' 



(38) 



Now, as in Case 2, we will condition on the last two inequalities, and sprinkle new vertices 
independently in the es direction from B p , (• • ■ (B p , (Ti) ) ■ • • ) and in the e± direction from 

B w (' ' ' ( B w (f 2) )•••)■ The key observation here is that v± and t> 2 will be in the same compo- 
ncnt if 

^ B k^ ( - • • ^ (Ti) } • • • } n ^ B k^ ( - • • ^ (T2) } • • • } * - 

That is, if there exists a vector (^1,^3,^4, . . . ,£d) such that "p^-^^'^M contains a vertex in 
B l'„ .,(••• ( B i ( T i) ) • • • ) and a vertex in ££, (• • • (B^ (T 2 ) )•••). If we define the following 



P(2d-S) 

events 



l> 2 ' 



(<l/3/4, 



n T 3 



(l,3,4,...,d) 



^0} 



ain 0.3?! 



LI LI E (ti,t3,u,...,td) 

£ 1= 1£ 3 = 1 

■ LL U 



1=1 
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then E is contained in the event that v± and v% are in the same component. To estimate the 
probabilities of these events, let us temporarily fix (£4, . . . ,£d), and define the variables: 



# {t* € {1, . . . , « 2 n} : „ G H i# (• ■ ■ (i% (T 2 ) ) • • • )} 



1.(2) ._ 



In other words, fc^ is the number of columns, 



(1,4, 



which contain at least 



one vertex in (• • ■ (B^, (Ti) )•••)■ Similarly, fc^ 3 is the number of rows, V { 



V 



{t 3 ,ti,...,e d ) 

(3,4,...,d) 



(2d-7) 



(2,3,4,. ..,d) 



c 



, which contain at least one vertex in B\ (• • • (B c \ (T2) )■••)■ We can now estimate 

"(2d — 6) "2 



the probability of E c , conditional on the complements of the events in inequalities (34) - (36 1. 

t-WN / . . t< 2 >\ 



( £ (dA,'4,.Jd: 
p (s; 



1 



1 



1 



P(2d-5) 



7 («4,.../d) 



ain 0,371 

* nn 



1-1-1- 



1 



-(2d-5) 

2n 



1 



P(2d-4) 



1-1- 



£(2c2-4) \% 



2?) 



s n n [1 - ^^^ w 

< exp 



1 = 1 ^3 = 1 

e 



ain 0371 

(2d-5)£(2rf-4) y> fc (i) k {2) 



< exp 



exp 



16n 2 

e (2d-5) e (2d-4) 

16n 2 

e (2d-5) e (2d-4) 

16~~ 



(logn) 4d - u 
1 



(£ c ) < exp 



(logn) 

e (2d-5) e (2d-4) Ilf= 



8d-22 



n d-3 



4 "-i 



16 



(logn 



|8d-22 



(39) 



In the second line of the above inequalities, the probability on the left is asymptotically less than 
the quantity on the right because p[ = ei/n + 0(n 2 ). The third line is obtained from inequality (26 1, 
which holds here because k^ , k^ = o{n) with high probability from inequalities ( 35 ) and ( 36 ) . The 
fourth line is an application of the bound 1 — x < e~ x . The fifth line is from the definitions of k^p 



(2) 

and k; ' the observation that 



«3 = 1 



(2) 



(1,4,... 



f 3 n^(^ 2d _ 6) (...(^(T 2 ))...) 



and inequalities (37) and (38). In the last line, we merely took the intersection of the events, 
E?^ td y over all (£4, . . . , £d), which are independent. When combined, inequalities (34) - (36), 
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(39), and Lemma 3.1 imply that for d > 4, the probability of i>i an d t?2 not being in the same 
component is at most 0(n~( d+1 )). This completes the proof of Lemma 



3.4 



3.4 The Size of the Giant Component 



□ 



To complete the proof of Theorem |1.2[ we need to demonstrate that the proportion of remaining 
vertices included in the giant component approaches (1 — q) > in probability. To this end we will 
prove Lemma [3T6| but first we will define q. 

Recall from earlier that qi — P (Z t = for some t \ Zq = ej) is the extinction probability for 
a multitype branching process in which for any k = 1, ...,ef, (Z\ \ Zq = e&) ~ Poisson(Aaj) if 
j 7^ k and [Z\ \ Zq = e&) = 0, and initially there is one individual of type i. The initial vertex 
in the cluster discovery process gives birth to Binomial (a^n, — ) neighbors of type i for each i, and 
henceforth proceeds like the multitype process in which no vertex can give birth to its own type. 
The limiting branching process is one in which each binomial birth event is replaced with a Poisson 
birth event with the same mean. If we consider (d— 1) independent copies of this Poisson multitype 
branching process with the modified initial step, and we define q to be the extinction probability of 
one of these copies, then the collective process will have the same distribution for all time as the 
multitype branching process that starts with one individual of each of the d types. This implies that 
L~L 1i = Q ■ From the theory of multitype branching processes [2], the vector (<?i, . . . , qj) is the 
solution to f(x) = x for x e (0, l) d , where 



fi(x) = exp 



-Xy^ajil-Xj) 



3& 



Thus, we have implicitly defined q < 1. 
Lemma 3.6. 



#{v&V : \C V \ </31ogn,^ = l} 



► q in probability. (40) 

Our approach in the proof of Lemma |3.6| will be to show that the probability of a vertex being 
included in a component of size at most ftlogn given that the vertex is occupied approaches q. 
Then we will apply a second moment method argument to demonstrate that the actual proportion 
of occupied sites in small sized components approaches q. This will require showing that the events 
{\C V \ < /31ogn} are asymptotically uncorrelated. 



Proof of Lemma \3.6[ First we will show that P (|C„| < (3logn | = 1) — ^ q. Let Z t be the multi- 
type branching process with 



(Z( +1 | Z t = e k ) 



Poisson(Aaj) j ^ k 
j = k 



for k — l,...,d and t > 1, but with Z{ ~ Poisson(Aaj) for all j = 1, ...,d. Then let T := 

1 + || Z t || 1 be the total size of this branching process, and let r := inf {t | || S^tH ^ = 0} be the total 

t>i 

size of the upper bounding binomial branching process (recall that St is the random walk version 
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of this branching process). Also, let the Poisson branching process be coupled with the binomial 
branching process so as to minimize their total variation distance, dTv{Zi>S{) < [3]. Thus, 

P (T < /3 log n,T ^ t,t < plogn) < aiA ^ lo S" ; ( 41 ) 

n 

P(T < /31ogn,r > /31ogn) < " 1 ^ 1 ° g ' 1 , (42) 

since in both events the two branching processes must differ in at least one of the first (3 log n birth 
events, and the probability of any birth event differing in the two branching processes is bounded 
by the total variation distance of their distributions (here we are referring to a randomly distributed 
number of children of a single type as a 'birth event'). From the above discussion, we know that 
P (T < oo) = q. So we have: 

V(\C V \ < /3logn | =1) > P(r</31ogn) 

> P(r</31ogn)-2 aiA/310g " 

n 

-> Q, 

which implies that liminf P (|C„| < /31ogn | = 1) > q. For the upper bound, we want to know 
that up to time s = f3\ogn, the processes At and S t are identical with high probability. Let 
T4 := inf {< I >4. t = 0}. Recall that the number of extra births in St that are lost in At up to time 
sATa can be stochastically bounded above by ~ Binomial(<i/3if (log n) 2 , ^), as per (21 1. Since, 
for n > 2\d(3K(\ogn) 2 , 



— / \dpK(\ogny 



< E 

fc=i 



< 2Ad/3if(logn)^ (43) 

the probability that the two processes will differ by time s is small. Now for the upper bound: 

P {\C V \ < /31ogn I = 1) = P {As = I & = 1) 

< P (A = 0, A = S t W < s | = 1) 

|P(A/ S t for some i < s | = 1) 

< P (S s = 0, At = S t Vt < s | & = 1) + p (V (s) > 1) 

2Ad^(logn) 2 



< P (t < /3 log n) + 



<P(T< /3 l gn) + 2 a ^ 1 °g" + 2A ^ 10 ^) 2 

n n 



The fourth line above uses the estimate in (43) and that S s is independent of the vertex v, and 
the fifth line uses the total variation distance bounds that we found earlier for the two branching 
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processes. Thus we have shown that limsupP (\C V \ < f3\ogn \ = 1) = q, and so 

lim P (|C| </31ogn | & = 1) = q. (44) 

n— > oo 

Now in order to complete the proof, we will apply the second moment method to the following 
random variables: 

'l if \C V \ < plogn, = 1 



H„ = 



else. 



Note that 



Y,H v = #{veV : \C v \<p\o E n, = 1} 



J2EH v = xlY[a i )n ( - d - 1 

nev V i / 



>P(|C„|</31ogn| &, = !), 



so we wish to prove that 



lim jr gyg " = 1 



in probability. To apply the second moment method we need to bound the variance of the sum of 
the H v 's, and to do this we need an upper bound on the covariance, EH V H W — (EH V ) 2 . We begin 
by observing that 

EH V H W = p 2 P (|C| < /3 log ra, \C w \<piogn | = 1, £„ = 1) . (45) 
If w £ Af(v) then 

P(|a| </31ogn, \C w \</3logn | &, = 1, £ w = 1) = P (\C V \ < /3\ogn | & = 1, = 1) 

<P(|C B | </31ogn |& = 1). 

Let us assume now that w ^ and we will denote by J„ C V such that « £ J„ an arbitrary 

set of vertices containing the vertex v. Then 

P(|C,| </31ogn, \C w \<0logn\^ v = l, £«, = !) 



^ ^ ' P {G v — J v , C w — J w | — 1, t; w — 1) 

|J«|<y9IogTi |J„|<j91ogn 



|7„|</91ogn |J„|</31ogn 
J„,rW(./„)=0 



^ ' P (Cu — J v — C w \ ^ v — 1, £ w — 1) . 



(47) 



|J„|</31og« 
Jv3w 



since if J w fl M{J V ) ^ then the event that C w = J w has zero probability unless J w = J v . We 
will bound lines ( 46 ) and ( 47 ) separately. Beginning with line ( 46 ) , we condition on the event that 
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C v = J v to obtain: 



y ' y ' IP (C-u — Jo | £o — 1) £tu — 1) P (C TO — J w l^-o — l, S, w — 1) C« — Ji>) 



|J„|<,31ogn |J„|</31ogn 
J„,n/V(J„)=0 

= ^ ' P (C v = J v | = 1) y ' P (C w = J id I £«, = 1, C„ = J„) . 

|J„|<£logn |J„|<^logn 

The second line is because when J w n AT(J V ) = 0, the event that C v = J v is independent of 
£ w , and the event that = 1 is contained in the event that C v = J v . Now, if C v — J v and 
J w C\N(J V ) — then we know that the vertices in Af(J w ) (lftf(J v ) must not be occupied, otherwise 
C v would have to include J w . Additionally, these are the only vertices that both of the events 
C v = J-,, and C w — J w depend upon, otherwise these two events use independent vertices. Noting 
that \Af(J w ) nN"(J v )\ < 2 | J„| |Ju,|, since each pair of non-neighboring vertices has at most two 
common neighbors, we have that (46) is bounded by: 



P (C w — J w | £k, — 1) 

A s 2/32 (log n)2 



< y, p {Cv = j« i = i) E . . , 

\J v \<f3\o & n \J w \<piogn l 1- ^) 

J w nM '(J„)=0 

/ 1+ 5A/3 2 (logn) 2 \ ^ P(a = J.|e. = l) E P(C» = ^I^ = 1) 
^ '' / |j^|<^i og „ |^|<^logn 

5A/? 2 (logn) 2 ^ ( EiJ») 2 

p2 



(48) 



In the first line above we used that the event that none of the vertices in JV(J w )Dj\f(J v ) are occupied 
is contained in the event that C m = J w . In the second line we applied the following computation, 
in which the inequalities hold provided n > 2A and n > 32A/3 2 (log n) 2 : 

, v 2/3 a (logn) a .2^(log„) 2 



1 - - / " V n 



4A/3 2 (logn) 2 32A 2 /3 4 (logn) 4 

1 T" T" rj 

n n z 
< 1 | 5A/3 2 (logn) 2 
n 



To bound line (47 1, we observe that 



£ P (C„ = J„ = C w \ & = 1, ^ = 1) =P (|C„| < /31ogn, C„ = C w | & = 1, U = 1) 



|J„|</31ogn 



< P (A t n Af(w) 7^ for some t < f3 log n \ A = {v}) 

^ \ 2/3 log n 



< 2A/3 log n 8A 2 /? 2 (logn) 5 



7? 



< 3A/?l0gn (49) 
n 
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In words, the second line above says that the probability that C v has fewer than (3 log n vertices and 
contains the vertex w (conditional on v and w begin occupied) is at most the probability that we 
find an occupied neighbor of w by time j3 log n during the process of discovering the cluster C v . In 
the third line, we used the fact that at each time, t < /31ogn, until A t n Af(w) ^ U we have that 
\Af(v t ) H Af(w)\ < 2, and at least one such vertex must be occupied in order for w to be discovered. 
The last two lines require that n > 8A/31ogn. 



Combining (45) - (49) shows that the covariance between H v and H w when w ^ Af(v) can be 
bounded as: 



< 



6A 3 /3 2 (logn)' 



(50) 



for sufficiently large n, since EH V < p. Now we can use this to bound the variance: 



Var ( Y H v 

\veV 



Y K //,.//„. + Y EH v H ™ - ^llvEH,, 



< J2 EH vH w + J2 PE# V #«, - EH V EH W ] 



ioeJV"(t>) 



< 



A 2 (j[ a i) ^dn^ +6\ 3 f3 2 (j[ a i^ n^\\ognf 
O (n^Xlognf 



(51) 



Using this bound on the variance and Chebyshev's inequality yields: 



E ve v EH v 



- 1 



> n 



-1/3 



< 



Y Hv - Y EH - 

vev vev 

Var (J2vev H " 



> 



X {\{ a ^j V (\C V \ < plogn | ^ = 1)^ 1/3 ^) 



A 2 (I], a t f (q + o(l))2 n (2rf-2-2/3) 
O fn" 1/3 (logn) 2N 



Combining (44) and (52) completes the proof of Lemma 3.6 



Thus the proof of Theorem 1.2 is complete 



(52) 

□ 
□ 



4 Connectivity Threshold 

We have demonstrated that when p — X/n, a giant component emerges at A c . The purpose of this 
section is to determine when the random site subgraph of the Hamming torus is connected. We 
prove the following theorems. 
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1.5 



Theorem 



1.6 



restated). Let c < J=j^7- If P — p{n) < c 1 ^ s - and p — u{n d ) then the random 
the Hamming torus contains isolated vertices, and is thus not connected (a.a.s.). 

restated). Let c > yT 1 . // p — p(n) > c l °f - then the random site subgraph of the 

VP 



Hamming torus is connected (a.a.s.) 

restated). Fix a\ > 02 > • ■ ■ > ad, and let c > 



d-l 



c log n 



then 



every vertex in the random site subgraph of the Hamming torus is either isolated or belongs to the 
giant component (a.a.s.). 

These three theorems together imply that, with probability approaching one, the random site 
subgraph of the Hamming torus is connected if and only if it contains no isolated vertices (except 
in the trivial case p x n~ d , where the subgraph may consist of just a single vertex with positive 
probability) . 



Proof of Theorem \l-4\ We denote by d v the degree of the vertex v in the random site subgraph of 
the Hamming torus, with the convention that d v = — 1 if £ v = 0. Let be the number of isolated 
vertices in the random site subgraph. Then 



EI n =pJ2v(d v =0\ & = 1) 

(J[a i )n d (l-p)^ ain - d 



am 



b n " means that < lim |^ < 00. The last line above follows from the limit 



P 

xpn d exp \-p^2 

where "a 

lim (l-p)"e pn 

n— >-oo 

provided p = o(n -1 / 2 ). If ri -1 < p < cl °& n ; then equation (53) is bounded by 



(53) 



1 



00. 



Otherwise, if n d+1 <C p < n 



then equation ( 53 ) yields 



— > 00. 
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We will now use the second moment method to show that /„ stays close to its mean. 
vev wgv 

= ^] P (d„ = 0) + V(d v = 0, d w = 0) 

d(v,w)=l 

+ J2 P ( d « = °> d ™ = °) + £ P (d« = 0, d«, = 0) 

cZ(v,uj)= 2 d(u t iw)>3 

= EI„ + + Y P(d v =0\ d w = 0)¥(d w =0) + Y P ( d « = °) P ( d ™ = °) 

d(v,w) — 2 d(v,w)>3 

=EIn+ E Pw^a^a + E P((l ._o)F(*.-o) 

2 d(v,ii?)>3 

In the second line above, the second sum is equal to because if v and w are neighbors, then they 
cannot both be isolated. The last sum in the second line is equal to the last sum in the third line 
because if d(v, w) > 3, then the events {d v = 0} and {d w = 0} rely on disjoint sets of vertices and are 
thus independent. In the case where d(y, w) = 2, these events share exactly two vertices in common, 
thus P (cf„ = 0) = P (d„ = | d„, = 0) (1 -p) 2 . Let N k = #{v G V | d((l, 1, . . . , 1), v) = k} be the 
number of neighbors at exactly Hamming distance k that a vertex in the Hamming torus has. Then 

Ell = EI n + \V\N 2 V ^ V + \V\(\V\-N 2 -N 1 -N )¥(d v = 0) 2 

\ 2 / -i . k \ , itaI l\tr\ at \ ire /j n\2 



<E/„ + \V\N 2 ¥(d v = 0y (l + 5p) + \V\(\V\-N 2 )¥(d v =0Y 

\V\N 2 ¥(d v =0) 2 4 
< (l + (3(log 2 n))E/„ + (E/„) 2 , 



= EI n + 5p\V\N 2 ¥{d v =0) 2 + |vf P(c4 = 0) 2 



Var(J„) = 0(log 2 n)E/„. (54) 

In the second line above we applied the estimate (1 — p)~ 2 < (1 + 2p) 2 < (1 + 5p) whenever p < 1/4. 
In the fourth line we used the following facts: EI n = \V\ ¥ (d v = 0), P (d v = 0) < p < and 
N 2 = 0(n 2 ). We can now bound the fluctuations in I n by 

I n -EI n \ > (EI n y/ 2 \og 2 n) < J ai(I i = 0((logn)- 2 ) . (55) 

/ EI n log n 



Since EI n 3> n c ^> log 4 n for some e > 0, this completes the proof of Theorem 



1.4 



□ 



The proofs for Theorems 1 1 . 5| and 1 1 . 6| will be similar in that they appeal to the machinery that we 



have already developed in the proof of Theorem 1.2 In particular, we will merely replace Lemma 3.3 



with a more assertive statement, then apply Lemmas |3. 1[ |3.4] and the arguments of Section [3. 1| with 
little modification. 
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Proof of Theorem 1.5 Let p 



c log n 



, and observe that monotonicity implies the result for larger 



p. Recall how we coupled a lower bounding random walk, Wt, with A' t in Section 3.1 We wish to 
construct the same lower bounding walk here, and we will even use the same parameter pi = — where 

Ai is a constant such that A c < Xi < \i d . The only difference here is th at n ow p = Pi 
where P2 > 6 los - , which is okay because all we need to apply Lemma 



equation ( 13 



3.4 



-P2 



is that P2 > 



P1V2 
We 



then can apply Lemma |3.1| and repeat the arguments at the end of Section |3.1| to again obtain 



which says that, provided \A! \ > to — 



and \R' \ 



o(n 2 / 3 ), 

,«J-4/3 



the process of discovering occupied vertices with parameter p\ will survive until time r = n" 
At this point, we can apply Lemma |3.4| to say that with high probability every cluster reaching size 
r will connect into a single giant component. Thus, to show that the entire subgraph is connected, 
we need to show that either \C V \ > to or \C V \ =0 for every v 6 V with probability approaching 1. 

Consider a fixed vertex, v. Disregarding prior notation for the moment, let A^ be the set of 
vertices in the random site subgraph of the Hamming torus to which there exists a unique shortest 
path of length k from v. Then for any constant £ 



\ (|A 2 | < m I \A X \ =1) =P p ( 



e lA2l > e- m 



\Ai\=i) 



< e m E z 



= e m [! _ p(1 _ e -fWW.)l 

< exp [m-p(l- e- 1 ) \Af(Ai) \ M{v) 

d 

n Incf n / . 

< exp to - 

\i=2 
d 



exp 




72 



(l - e" 1 ) J^fli logn + c(l - e- x )^ + d- 1 



log n 



i=2 



(56) 



In the fourth line above we used the bound (1 — x) < e~ x . In the fifth line above we assumed 
WLOG that ai > a,2 > • • • > a^, and used the fact that each of the £ vertices in Ax has at least 
(da + • • • + a,d)n — £ — d + 1 neighbors that are not also neighbors of v or any other vertex in A\. 
Now, if we choose £ to be any constant so that 



then ( 56 ) implies that 



72 



1 d 

- - ic (l - e- 1 ) ^ < -(d + 1) 



£> 



(d+l)(l + l/ 72 ) 
c(l- e _1 )E 4 d = 2 «i' 



P P (|A 2 | <to I L4i| = i) = O (n-< d+1 >) 



(57) 



(58) 



Let ^4 be the multitype branching process with Z\ ~ Binomial(ain,p) for all i = 1, . . . , d, and for 
all t > 1, (Z( +1 I = ej) ~ Binomial (a,jn,p) for all i and j 7^ i, and {Zl +1 \ Z t — e») = 0. Observe 
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that \Ak\ < ||-Zfe||i by a now familiar coupling. For = log 5 « 1.6 
\Zi ||i > 5^a lC logn) < e" 59 ^ a ' clog ™Ee e ll Zl "i 



< exp -56»^a 4 clogn + (e e - l) ^ 

< exp (e 9 — 1 — 5$) ^ a^c log nj 

< exp [-4(d - 1) logra] 
= 0(n-( d+2 )). 



(59) 



Now, applying a union bound, 

l^alli > (^^aiclogn] \\Zx\\ x < 5 OjC log 



< (s^aiclognjpfjlZxIl! > 5^a lC lo g) 
= 0(n-( d+1 >). 



(60) 



Combining the last two inequalities shows that |Ai| and \A2\ are both o(n 2 ' 3 ) with high probability. 
So we can apply Lemma 3.1 with A' = A2, R' a = A\ U {v} and [To = V \Af(R' a ), and continue 
the arguments discussed above to show that all vertices for which d v > I will be in the same large 
component with probability exceeding 1 — 0(n _1 ) (by a union bound over all vertices in V). Thus, 
we need to show that all vertices either have d v = — 1 or d v > I. 



{d v <t\d vj t -l) = ]T 
3=0 

e-i 

3=0 



J2at n - d 
3 



clogn^"' / clogn 



E mn—d—j 



(c log n) J 



Yl a i c l°g n-\-(d-\-j)c log n/n 



< iiclognf- 1 n-^ CHC e {d+l - 1)closn/n max 



US' 



= 0(n-^° 4C (logn) / - 1 ). 
So, since et;c > d — 1, we have that 



(0 < d v < i) = 0(i 



-EoiC- 1 



(lognr) = (n- rf ), 



and by the union bound 



P (0 < d v < i for some v e V) = o(l). 



This completes the proof of Theorem 1.5 



□ 



Proof of Theorem 1.6 The proof here will be very similar to that of Theorem |1. 5 [ except that here 
we will show that with probability approaching 1 one of three things can occur for each v € V: 
\C V \ — 0, I Co I = 1 or \C V \ > m. The last case implies, by the discussion at the start of the last 
proof, that v is in the giant component. We will also rely on the same notation from the last proof. 
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If d v < then v is either isolated or not in the subgraph, so we consider the cases where d v > 1. 
If d v > £, where £ is defined according to (57), then equations (56) and (58) still hold, as well as the 
arguments that follow, so v will be in the giant component with probability approaching one. Now 
we must consider the case where 1 < d v < I. Suppose d v = I where 1 < I < £, then 



\A 2 \<t, d v =l) 



P l (l- P ) 



E 



a t n-d-l S^( mAl)XAf{V)l )pl( 
3=0 \ 3 J 



1-p) 



|Af(ii)\Af(r-)|-j 



3+1 (clogn) J++1 ((l+1) yf_ n 04+0l )_i (j+Z(J+d-l))clogn/r 



3=0 



J! 



= o(n~ d ). (61) 

The second line above is because each of the I vertices in Ax have at least (02 + ■ ■ ■ + a,d) n — l — d+l 
neighbors that are not also neighbors of v or any other vertex in Ai. The last line above is because 

c > ^-^d d l — • Summing the above probabilities implies that P (\A 2 \ < £, 1 < d v < I) = o(n~ d ). 

So for every v G V, if d v < 0, then v is either isolated or not in the random subgraph. If d v > £, 
then v is in the giant component with probability approaching one. The probability of none of the 
last three cases occurring and v having fewer than £ occupied vertices at distance 2 approaches 0. 
To complete the proof, we merely need to show that if v has £ occupied vertices at distance 2 then v 
will have at least m vertices at distance 3. This is easy because the arguments are identical to those 
of inequalities (56), (58)-(60), but with \A 2 \ and \Ai\ replaced by \A 3 \ and \A 2 \, and £ replaced by 
A£ in inequality (56). This last substitution is because if |^4i| < I and \A 2 \ = £ then each vertex in 
A 2 has at least (a 2 + ■ ■ ■ + ad)n — A£ — d + 1 neighbors that are not also neighbors of v or any other 



vertex in A\ or A 2 . This completes the proof of Theorem 1.6 



□ 
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